Metrical-Accent Aware Vocal Onset Detection in Polyphonic Audio

نویسندگان

  • Georgi Dzhambazov
  • Andre Holzapfel
  • Ajay Srinivasamurthy
  • Xavier Serra
چکیده

The goal of this study is the automatic detection of onsets of the singing voice in polyphonic audio recordings. Starting with a hypothesis that the knowledge of the current position in a metrical cycle (i.e. metrical accent) can improve the accuracy of vocal note onset detection, we propose a novel probabilistic model to jointly track beats and vocal note onsets. The proposed model extends a state of the art model for beat and meter tracking, in which a-priori probability of a note at a specific metrical accent interacts with the probability of observing a vocal note onset. We carry out an evaluation on a varied collection of multi-instrument datasets from two music traditions (English popular music and Turkish makam) with different types of metrical cycles and singing styles. Results confirm that the proposed model reasonably improves vocal note onset detection accuracy compared to a baseline model that does not take metrical position into account.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Psychoacoustically Motivated Sound Onset Detection Algorithm for Polyphonic Audio

We propose an algorithm for sound onset detection applying principles of psychoacoustics. A popular model of loudness perception in human auditory system is used to compute a novelty function that allows for a more robust detection of onsets. The psychoacoustics paradigm also allows us to define thresholds for the novelty function that are both physically and perceptually meaningful and hence e...

متن کامل

Mirex2014: Audio Melody Extraction

This paper describes our submission for the audio melody extraction task of the Music Information Retrieval Evaluation eXchange (MIREX 2014). Our algorithm first separates the vocal spectra from polyphonic sound spectra. Melody extraction and vocal activity detection are applied to the separated spectra.

متن کامل

Handling Asynchrony in Audio-Score Alignment

Aligning a canonical score to an audio recording of a musical performance can provide very good information about the timing of individual notes. However, a score representation frequently treats multiple note events as simultaneous, whereas in reality different performers will start notes at slightly differing times, and these timing details may be significant in the analysis of performance an...

متن کامل

A New Method for Musical Onset Detection in Polyphonic Piano Music

In this paper, we propose a musical onset detection method, with reference to polyphonic piano music. The solution proposed consists of an onset detection algorithm based on Short-Time Fourier Transform (STFT) and Non-Negative Matrix Factorization (NMF). This method operates on a frame-by-frame basis and exploits a suitable binary time-frequency representation of the audio signal. To validate t...

متن کامل

Timbre and Melody Features for the Recognition of Vocal Activity and Instrumental Solos in Polyphonic Music

We propose the task of detecting instrumental solos in polyphonic music recordings, and the usage of a set of four audio features for vocal and instrumental activity detection. Three of the features are based on the prior extraction of the predominant melody line, and have not been used in the context of vocal/instrumental activity detection. Using a support vector machine hidden Markov model w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017